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ABSTRACT 

The primary purpose of the present study was to 
compare three measures of recognition memory. One hundred and 
forty-four high school students were instructed .to study silently 36 
CVCVC verbal units (target items) at a 4-sec, rate of presentation. 
The Ss were then given one of three tasks which required them to 
identify target items from among distractor items (also CVCVC verbal 
units) , The tasks were the Embedded Item, Multiple Choice, and 
Sbepard-Teghtsoonian, Results demonstrated that the Multiple Choice 
task yielded the greatest number of correct recognitions, lowest 
number of false recognitions, and lowest number of nonrecognitions 
when compared with the other two recognition tasks. Although scores 
on each of the three tasks correlated with meaningf ulness (M) , 
associative reaction time (RT) , and pronunciability ratings (PR) of 
the target items, the magnitude of the coefficients for M and RT were 
almost identical (rs ranged from .41 to .67) but greater than the 
coefficients between PR and recognition scores (rs ranged from ,36 to 
.59). The magnitude of the coefficients for the assessment values was 
greatest for the Embedded Item task and least for the 
Shepard-Teghtsoonian , (Author) 
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Abstract 

Measures of Short-Term Recognition Memory 

Ronald Ley and Jurgen Karker 

State University of New York at Albany 

(Schedioled for presentation at the Annual Meeting of the Psychonoraic 
Society, St. Louis, Missouri, November, 1973.) 

The primary purpose of the present study was to compare three 
measures of recognition memory. One hundred and forty-four high school 
students were instructed to study silently 36 CVCVC verbal \inits (target 
items) at a 4-sec. rate of presentation • The Ss were then given one of 
three tasks which required them to identify target items from among dis- 
tractor items (also CVCVC verbal iinits). The tasks were the Embedded 
Item, Multiple Choice, and Shepard-Teghtsoonian. Results demonstrated 
that the Multiple Choice task yielded the greatest number of correct 
recognitions, lowest number of false recognitions, and lowest number of 
nonrecognitions when compsred with the other two recognition tasks. 

Although scores on each of the three tasks correlated with meaning- 
fulness (M), associative reaction time (RT), and pronunciability ratings 
(PR) of the target items, the magnitude of the coefficients for M and RT 
were almost identical {xs ranged from .hi to ,6?) but greater than the 
coefficients between PR and recognition scores (rs ranged from .36 to 
.59). The magnitude of the coefficients for the assessment values was 
greatest for the Embedded Item task and least for the Shepard-Teghtsoonian. 
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Measures of Short-Term Recognition Memory*" 
Ronald Ley and Jurgen Karker 
State University of New York at Albany 

Current research in retention has focused on recognition and recall 
as separate memorial processes, \ftiile methods of measuring recall are 
relatively uniform, methods of measuring recognition are less so. The 
three methods meet commonly used to measure recognition memory are: the 
Embedded Item task (El), the Multiple Choice task (MC), and the Shepard- 
Teghtsoonian task (ST). The EI task typically has a format of verbal 
units appearing in columnwise fashion on one or more test pages.. The 
task requires £ to review the relatively large list and identify ^rerbal 
vuiits (target items) which were presented during an earlier study period, 
from among those units which were not presented during the study period 
(distractor items). Distractor items usually outniomber target by a ratio 
of at least 2:1. The MC task requires S_ to select the target item from 
among a relatively small set of verbal units, e.g., one target item per 
set of two to four distractors. The ST task requires S to sort a deck of 
ciirds in which each card contains one verbal unit. Subjects are required 
to sort target-item cards, v/here the number of distractor cards to target 
cards is in the ratio of about 2:1. 

The primary purpose of the present study was .^o compare these three 
methods of measuring recognition v/ith respect to performance scores 
(correct recognitions, false recognitions, and nonrecognitions) . 

The secondary purpose of the present study was to determine whether 
the three methods of measuring recognition memory interact with academic 
ability. That is, if recognition memory is a significant factor in aca- 
demic achievement, high ability sttidents would be expected to make more 
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correct recognitions, fev;er false recognitions, and fewer nonrecognitions 
than low ability students. However, it is conceivable that part of the 
expected difference in recognition performance between high and low 
ability students may be a function of the method used for measuring recog- 
nition memory. This possibility has important implications for research 
designed to tes^ theories of recognition memory and research designed to 
measure academic achievement or intellectual aptitude in terms of recogni- 
tion memory scores (e.g., multiple choice tests). 

The tertiary purpose of the present study was to compare three 
assessment characteristics (associative reaction time--RT, meaningfulness- 
M,' and pronimciability ratings—PR) of the verbal units used in the study 
with respect to their relative capabilities for predicting correct recog- 
nition scores of the high and low ability groups under each of the three 
recognition tasks. 

Method 

Subjects , — The lUh Ss were hig'n school seniors, 72 of whom were ran- 
domly selected from the top third of their class with respect to grades 
and 72 of vrhom were randomly selected from the bottom third of their class 
All Ss were naive with respect to previous experience in verbal learning 
studies. 

Materials . — Thirty-six target items vrere randomly selected from the 
list of 319 CVCVC words and paralogs assessed for associative reaction 
time (RT) by Taylor and Kimble (I967), for meaningfulness (M— scaled- 
rated) ty Locascio and Ley (1972), and for pronunciability (PR) by Ley 
and Barker (1973) • The 72 distractor items were selected from the reriiain- 
ing 283 units of this list plus the CVCVC units of Andreas* (1972) list. 
The target and distractor items were randomly selected with the single 
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.restriction that none of the items shared formal similarity with respect 
to the first three letters of the CVCVC units, i.e., no two of the first 
three letters of a uiiit appear more than once in the entire list of I08 
items (36 targets and 72 distractors) . 

The materials for the EI task consisted of a single sheet of 8^- by 
11-in, paper on which four colijmns of 27 units each were printed. The 
column and row location of each target and distractor item was randomly 
determined. 

The niaterials for the MC task consisted cf U8 decks of 36 Hollerith 
computer cards. Each card contained one target item and two distractor 
items printed on the top row of the card,. The ordinal position of the 
target and distractor items on each card was randomly determined. 

The materials for the ST task consisted of U8 decks of 108 Hollerith 
computer cards . Each card contained either a single target item or a 
single distractor item printed on the top row of the card. 

The materials used for the presentation of the target items during 
the study period consisted of 36 35-nim, slides each one of which contained 
a single target item. The slides were projected on a film screen by means 
of a Kodak Carousel slide projector. Presentation rate was controlled by 
the internal timer of the projector. 

Procedure . --During the study phase, Ss v/ere seated facing the film 
screen. All Ss received uniform instructions which required that they 
silently rehearse each unit as it appeared on the screen. The Ss were 
given four practice trials for the purpose of familiarizing them with the 
types of verbal units which were to be studied and the U-sec. presentation 
rate. 
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Following the study phase, S^s were given brief instructions relevant 
to their respective recognition task, i.e., S^s in the EI Group were re- 
quired to read down the typed sheet and i\o circle with a pencil those 
items presented dui*jug the study phase; S^s in the MC Group were required 
to circle that unit from among the three units printed on each card, 
which had been presented during the study phase; and S^s in the ST Group 
were required to sort their deck by placing cards containing items pre- 
sented during the study phase in one pile and cards containing other items 
in a second pile. All Ss were strongly discouraged from guessing. 

Results 

Three performance measures for each S under each of the three tasks 
were determined on the bases of (a) correct recognitions (total number 
of target items identified as "old"), (b) false recognitions (total num- 
ber of distractor items identified as "old"), and (c) nonrecognitions 
(total number of target items not identified as "old"). The mean numbers 
of correct recognitions i^or the high- and low-ability groups under each 
cf their re^^pective recognition task conditions are given in Table 1 
(n = 2k Ss per cell). 

Insert Table 1 about here 

Although the observed mean number of correct recognitions was greatest 
for the MC task and least for the ST task, a 3 X 2 analysis of variance 
(three tasks by two ability groups) indicated that differences among the 
tasks were not significant, F (2, 138) = 1.2U, £>.05; but differences 
between the groups favored the high ability £s, F (1, 138) = 63.09, £ < 
.001. The Task X Ability Group interaction was not significant, F (2, 
138) = .13, £>.05. 
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With respect to false recognition scores, the analysis of \Aariance 
indicated that the MC task resulted in the least numb'=5r of false re- 
cognitions and the ST in the most; F (l, 138) = 5.34, £<.01. 
Although the observed differences between the low and high ability Ss 
were relatively small, the low ability Ss made significantly fewer 
false recognitions than the high ability Ss; F (l, 138) = p <.001. 

The Task X Ability Group interaction was not significant; F (2, 138) = 
1.09, £>.05. 

With respect to nonrecognitibn scores, the analysis of variance 
indicated that the ST task resulted in the most nonrecognitions and 
the MC task in the least; F (2, 138) = 5.l6, £<.01, Although the 
observed difference in the mean number of nonrecognitions \inder the MC 
task favored the low ability Se, the overall test of the ability factor 
resulted in significantly fewer nonrecognitions for the high ability 
Ss; F (1, 138) = 5.82, £<.01. Furthermore, the Task X Ability Group 
interaction was not significant; F (2, 138) = .16, p> .05. 

Correlation coefficients between the three assessment characteristics 
of the target item^ and number of correct recognitions under the three 
tasks and two ability groups are given in Table 2. Although all of the 
correlations are significant, several distinct trends are apparent: 

Insert Table 2 about here 

(a) The differences between the correlation coefficients for the high and 
low ability groups within each of the recognition tasks are virtually 
nil — except for the PR by EI cells, where the correlation for the high 
ability group (£ = -.59) is considerably larger than that for the low 
ability group (£ = «,36). (b) Ttie size of the correlation coefficients 
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between the assessment characteristics and correct recognitions are 
ranked with respect to task, i.e., rs are largest iinder the EI task (ex- 
cept for the PR by low ability group, rs range from .59 to .6?) and 
smallest under the ST task (rs range from .hO to .53). (c) The correla- 
tion coefficients between RT and correct recognitions and M and correct 
recognitions are virtually the same under the EI and MC tasks and for each 
of the ability groups within each of these tasks. Under the ST task, M 
appears to be a better predictor than RT for both ability groups, (d) The 
correlations betvreen M and correct recognitions and RT and correct recog- 
nitions are larger than those between PR and correct recognitions under 
aLl three tasks and for both ability groups. 

Discussion 

The Embedded Item, Multiple Clioice, and Shepard-Teghtsoonian recogni- 
tion tasks provide very similar data when compared on the bases of correct 
recognitions. As expected, the high ability high school students made a 
greater number of correct recognit: Dns than the low ability students; and 

the differences betv/een the ability groups was almost iiniform among the 
three tasks. It should be noted that the variances of these scores were 
very similar; S.D.s ranged from U,50 to 5.95. 

The results of the analysis of the false recognitions indicated that 
the ST task yielded the largest number and the MC task the smallest. The 
exceptionally small number of false recognitions on the MC task coupled 
with the relatively small S.D.s suggest that the MC task may be subject 
to a floor effect. If false recognitions are of special interest to the 
researcher, the ST task would appear to be the most suitable. 

The results of the analysis of the differences between the ability 
groups on false recognitions under each of the three tasks indicated that 
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the low ability Ss made fewer false recognitions than the high ability Ss. 
This finding was unexpected. Since the Task-by-Ability-Group interaction 
was not significant it seems safe to conclude that this finding is not an 
artifact of the type of task^ It may be that this difference reflects a 
more conservative approach to test taking on the parts of the low ability 
Ss as compared to the high ability S^s, That is, low ability (low scholas-^ 
tic achievement) Ss by definition, have been incorrect on tests more fre- 
quently than high ability (high scholastic achievement) 3s. If subsequent 
information that "one's answer to a test question is wrong" constitutes 
punishment, then it might be expected that the more frequently punished 
S^s (low ability Ss) will require greater confidence in the correctness of 
their response before emiting the response. This hypothesis is partially 
corroborated by the nonrecognition scores, which indicated that the low 
ability Ss made more nonrecognitions than the high ability Ss. Although 
the Task-by-Ability-Group interaction was not significant, it should be 
noted that the nonrecognition scores under the MC task were in the opposite 
direction, i.e., the observed mean number of nonrecognitions was greater 
for the high ability Ss than the low. In this connection, it should also 
be noted that the significant Task factor on nonrecognitions was due al- 
most entirely to the relatively low number of nonrecognitions under the 
MC task, especially among the low ability Ss; whereas scores under the EI 
and ST task were very similar for both groups. 

With respect to the primary purpose of the present study, the com- 
parisons of the three recognition tasks siiggest: that: (a) they are about 
equally suited for the purpose of measuring correct recognitions, (b) the 
MC task, because of floor effects, may be unsuitable for measuring false 
recognitions, whereas the ST task is clearly most sensitive for this 
purpose, and (c) the MC task produces low nonrecognitions relative to 
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the EI and GT tasks. In vlev; of the floor effects on false recognition 
scores and the relatively low number of nonrecognitions, the MC task 
seems least suitable for measuring recognition memory^ Although the EI 
task produces correct recognitions and nonrecognitions equal to the ST 
task, it should be noted that false recognitions are almost equally low 
with respect to the MC task. With these considerations in mind, the best 
overall test of recognition appears to be the ST task. 

With respect to the secondary pxirpose of the present study, none of* 
the data indicated an interaction among the three measures of recognition 
and academic ability. The only suggestion of an interaction was found in 
the nonrecognitions, where scores of the low ability Ss were greater than 
those of the high ability Ss under the EI and ST task, but not the MC 
task; the observed means under the MC task were in fact in the opposite 
direction, i.e., scores of high ability Ss were greater than low ability 
Ss» This inconsistency with respect to the EI and ST tasks provides 
another reason for not recommending the MC task as a measure of recogni- 
tion memory. 

With respect to the tertiary purpose of the present s-uudy, the analy- 
sis of the correlations indicated that: (a) meaningf ulness (M) and asso- 
ciative reaction time (RT) were equally better predictors of correct re- 
cognitions under the EI and MC tasxcs than pronunciability ratings (PR), 
but M was a better predictor than both RT and PR under the ST task; (b) 
M and RT were most highly correlated with correct recognitions under the 
EI task and least under the ST task; (c) the correlations between PR and 
correct recognitions were uniformly moderate under all tasks except for 
the high ability £s under the EI task (r = -.59); (d) differences between 
the correlation coefficients of the high and low ability Ss within each 
task were remarkably similar, i.e^, except for the PR and EI task corre- 
Q lations (r = -.59 a^d r = -.36), the largest difference was between the 
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HSKsiB M by EI correlations (r = ,6? and r = .61), 
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Table 1 

Mean numbers of Correct Recognitions, 
False Recognitions, and Nonrecognitions 
for the High and Low Ability 
Groups Under Each 
Recognition Task 



Performance 


Academic 
Ability 


Recognition Task 


Meaeure 


Group 


Embedded Item ; 


Multiple 


Choice 


Shepard-Teghtsoonian 






Mean 


S.D. 


Mean 


S.D. 


Mean 


S.D. 


Correct 
Recognitions 


High 
Low 


27.38 
21,54 


5.52 
5.36 


29.91 
25.87 


5.60 
U.7I 


26.1+2 
22.88 


I+.50 
5.95 


False 

Recognitions 


High 
Low 


U.7I 
U.58 


3.35 
k.kl 


3.33 
2.66 


3.06 
2.88 


7.63 
6.29 


6.56 

1+.60 


Non- 
Recognitions 


High 
Low 


8.63 
lU.38 


5.36 
5.62 


5.^2 
3.88 


5.98 
U.72 


9.25 
12.88 


1+.75 
6.01+ 
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Table 2 

Product -Moment Correlation Coefficients Between 
Assessment Measures of the Target Items 
And Correct Recognition Scores 
Under the Three Recognition Tasks 
And Two Ability Groups (df=3^) 





Recognition Task 


Assessment 


Embedded 


Item 


Multiple 


Choice 


Shepard-Teghtsoonian 


Measure 


High 
Ability 


Lovr 

Ability 


High 
Ability 


Low 

Ability 


High 
Ability 


Low 

Ability 


Meaningfulness 


.67 


.61 


.52 


.55 


.U8 


.53 


Associative 
Reaction Time 


-.61+ 


-.62 


-.55 


r 0 


-.Ul 


-.U2 


Pronunciability 
Ratings 


-59 


-.36 




-.ko 


-.ko 


-.1+0 



Note.— r = .33, p = .025; r = .1+2, £ = .OO5. 
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